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Abstract. —Neochrome (NEO) is a particular chimeric photoreceptor that can respond to red/far 
red as well as blue light. Using this photoreceptor, some cryptogams may enhance light sensitivity 
at low light environment. However, the NEO sequence information is extremely limited. Using 
inverse PCR and single oligonucleotide nested PCR genome-walking approaches, the full-length 
gencmic sequences of iVFO genes were determined for three fern species Coniogramme intermedia 
var. glabra, PlagiogiTia distinctissima and Pronephrium lakhimpurnense. Sequence and structure 
analysis results indicate that they possess a conserved structure and organization. Introns are 
absent in three genes. They each contain an 4332. 4308 and 4317-bp ORF (Open reading frame), 
encoding a deduced protein with 1443, 1435 and 1438 amino acid residues. Their N-terminus 
consists of a putative functional phytochrome sensory module including PAS (Per / Amt / Sim). GAF 
(cGMP specific phosphodiesterase / adenylate cyclases / formate hydrogen lyase transcription) and 
PHY (phytochrome) domains, whereas the C-terminus contains a nearly complete pholotropin with 
two LOV (Light, Oxygen and Voltage-sensing) and one STKc (Serine/Threonine protein kinases, 
catahlic) domains. Phylogenetic analyses revealed that they are evolutionarily close to their 
homolog AcNEO found in Adiantum capillus-veneris. The 3D structures of the functional domains 
were computer-modeled, and the pivotal amino acid sites for each functional domain were also 
identified. This study represents the first comparative analysis of NEO nucleotide sequences and 
structures in ferns. 
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Light is one of the most essential environmental factors for plants that 
perform photosynthesis. Plants have developed a variety of photoreceptors 
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that perceive light signals from UV-B to far-red regions (Pepper et ai, 2001; 
Quail, 2002). Ph 5 rtochromes predominantly absorb red and far-red light, using 
a bilin (or linear tetrapyrrole) chromophore for light detection in seed plants 
and in cryptogams (Quail, 2002), and phototropins are UV-A/blue light 
receptors (Briggs and Christie, 2002; Folta and Spalding, 2001). Many 
photomorphogenic responses mediated by these photoreceptors, for example, 
circadian clocks, shade-avoidance, phototropism, stomatal opening and 
chloroplast relocation, have been described (Christie, 2007; Rockwell et ai, 
2006). Most plant species from algae to flowering plants use blue light for 
inducing phototropism and chloroplast movement (Campbell and Liscum, 
2001; Sakai et al., 2001), but some ferns and green algae use red as well as blue 
light for the regulation of these responses (Kagawa and Wada, 1994; Wada and 
Kadota, 1989). In the case of Adiantum capilius-veneris L., a particular 
chimeric photoreceptor named phytochrome 3 (later renamed as neochrome. 
NEO) has been identified (Nozue et al, 1998; Suetsugu et al, 2005), which can 
sense red and blue light (RL and BL) simultaneously to regulate phototropism 
and chloroplast movement (Kanegae et al, 2006; Kawai et al, 2003). 

Functional domains of NEO have been characterized through biochemical, 
genetic, and bioinformatic approaches (Crosson and Moffat, 2001; Pandini and 
Bonati, 2005; Tokutomi et al, 2008; Wagner et al, 2005), Its N-terminus is quite 
similar to canonical phytochromes, whereas the C-terminus is made up of a 
complete phototropin motif. It has been suggested that both phytochrome and 
phototropin photosensory systems coexist in NEO, conferring it the potential to 
process red and blue light signals synergistically, which increases the light 
sensitivity (Kanegae et al, 2006). Many polypod ferns preferentially grow in 
shady habitats under the canopy of dense angiosperm forests. For them, this 
dual-channel photoreceptor has been proposed to play a central role in 
developing adaptiv'^e mechanisms and promoting species diversification under 
low-light angiosperm forest niches (Kawai et al, 2003; Schneider et al, 2004). 

To date, similar NEO-like chimeric photoreceptors have only been identified 
in a few cryptogams; two were isolated from the green alga Mougeotia scalaris 
Hass. (Suetsugu et al, 2005), and four from ferns (one full-length sequence 
from Adiantum and three short fragments from Dryopteris filix-mas (L.) Schott, 
Onoclea sensibilis L. and Hypolepis punctata Mett., respectively) (Kawai et al, 
2003). Thus. NEO sequence information is still extremely limited. Here we 
report the isolation and sequence analyses of the full-length genomic NEO 
sequences from two polypod ferns Coniogramme intermedia Hieron. var. 
glabra Ching, Pronephrium lakhimpurnense (Ros.) Holtt., and one tree fern 
Plagiogyria distinctissima Ching. Phylogenetic analyses were conducted to 
discern the gene origin, and the secondary and tertiary structures of deduced 
proteins were further predicted using bioinformatics methods. 


Materials and Methods 

Plant materials and DNA isolation. —All ferns were cultivated at Wuhan 
Botanical Garden, Chinese Academy of Sciences, Hubei, China. For each plant. 



YANG ET AL.: RED/BLUE LIGHT CHIMERIC PHOTORECEPTOR GENES 


3 


Table 1. Primers used in diversified genome-walking experiments. 


Methods 

Oligo name 

IPCR 

5' GDP Fl 
5' GSP Rl 
5' GSP F2 
5' GSP R2 

Universal PCR 

5' GSP F3 
5' GSP R3 
GSP Fl 
GSP Rl 
GDP F2 
GDP R2 
GSP F3 
GSP R3 

SON 

3' GSP 1 

3' GSP 2 

IPCR 

3' GSP F2 
3' GSP R2 
3' GSP F3 


3' GSP R3 


Sequence (5'—*3') 

A TC ACGCCC AATTCCG A Y( A/C/T)TTC 
C AC AGCATTGTCTGCGTCA T A AG 
GTTCTGGTTCAGGTCGCATAC 
AGAAGACCCCACCAGGCTAATG 

TC ACC A ATGTCCGCC AAGA CCA AG 
TAGGCCATCACCCGATCGTAGC 
G ATGGCCT AC A AGTTCC ACGAG 
TGGCAGGGGTAGTGATAGA 
ATCACGCCC AATTCCG A Y( A/C/T)TTC 
TGH(C/T)ACTCCGATGAAGTACTGGA 
GAGGTGCTCGGTGAGAACTG 
CTCGGGCTTGAGGTCTCGGTAG 

TGAGGAATCTGCCCGCTTCTACGC 

CCACTGCATGGGTGTCATCTAC 

TCAAGCCCGAGAACCTGCTACTG 
A TCCCTCCGTGT A CTTGCT A A CC 
GAGGACTGGTGGGCGTTGGGAATC 
CGGTAAGATTGAAGAACCCTGTG 


® GSP referred to genomic specific primer, while GDP referred to degenerated primer. 


three pieces of fresh leaves were collected to isolate genomic DNA following 
modified CTAB protocols (Sambrook et al., 2001). 

Cloning and sequencing of full-length NEO gene from C. intermedia 
var. glabra. —Full-length genomic NEO genes were amplified using diversified 
genome-walking PCR approaches. Primers used are listed in Table 1. Specific 
and degenerate primers were designed to amplify the internal NEO regions of 
C. intermedia var. glabra based on the conserved ph)dochrome chromophore- 
binding and phototropin regions of between Adiantum NEO (Accession No.: 
AB012082) and M. scalaris homologous sequences (Accession No.; AB206967, 
AB206966). Using the primer pairs GSP Fl/Rl, GDP F2/R2 and GSP F3/R3, 
universal PCR amplifications were performed in 20 pi of reaction mixture 
containing 50 inM KCl, 10 mM Tris-HCl (pH 9.0), 0.1% Triton X-100, 2.0 mM 
MgCl 2 . 0.2 mM each deoxynucleoside triphosphate, 1 U Taq DNA polymerase, 
0.4 pM primers, 50 ng genomic DNA, and DNA-free water. PCR amplifications 
were carried out under the following conditions: the template was denatured 
at 94 C for 5 min followed by 35 cycles of amplification (94 C for 30 s, 62'C for 
30 s and 72 C for 90 s) and a final elongation step at 72 C for 10 min. 

Meanwhile, Inverse PCR (IPCR; Huang, 1997) was used to obtain the 5'- 
portion of the gene. The genomic DNA of C. intermedia var. glabra, was 
digested with Apol restriction enz\mies and ligated with T4 DNA ligase. Then 
the ligation solution was used as the PCR template. The first round and nested 
PCR amplification were performed with primer pairs 5' GDP Fl/5' GSP Rl, 5' 
GSP F2/5' GSP R2 and with LA TaqE following the procedures the same as 
described above except that the annealing temperature was 56'C. 
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The 3' NEO sequence of C. intermedia var. glabra was cloned by single 
oligonucleotide nested PCR (SON PGR; Antal et al, 2004) using primers 3' GSP 
1 and 3' GSP 2 (Table 1) for the first and second round PCR. separately. The 
DNA template for the first round PCR was 50 ng of genomic DNA. The PCR 
program was: one denaturation step for 3 min at 94 C, five cycles of 
amplification (30 s at 94 C, 1 min at 60 C, 3 min at 72'C) followed by one 
ramping step (30 s at 94 C, 3 min at 29 C, 3 min ramp to 72"C. 2,5 min at 72 C) 
and 60 new amplification cycles (30 s at 94 C, 1 min at 60"C, 2.5 min at 72-C) 
with a 10-min extension step at 72 'C. The second round PCR used 2 pi of a 1:50 
dilution of the first round PCR product as the template, and was performed 
with the following conditions: 5-min denaturation at 94 C, 30 cycles of 
amplification (30 s at 94 C, 1 min at 60 C, 2.5 min at 72"C) and 10-min terminal 
extension at 72 C. 

All PCR products amplified by the above steps were recovered from 1% 
agarose/TAE gels and purified with a quick PCR Purification Kit (Promega), 
and then cloned into PMD19-T vectors (Takara). The plasmids were 
transformed to Escherichia coli strain DH5a. Based on the color reaction 
using Xgal-IPTG system and PCR identification, white transformants were 
selected and positive clones were sequenced with an ABI PRISM 3730 DNA 
analyzer. Three clones were sequenced for each amplicon to control Taq 
polymerase errors. The overlapping sequences from various amplification 
steps were assembled in a single contig. 

Isolation of NEO genes from P. distinctissima and P. lakhimpurnense .—The 
internal regions of NEO genes for P. distinctissima and P. lakhimpurnense 
were amplified by using the same primers and PCR procedures as C. 
intermedia var. glabra. Primers 5' GSP F3 and 5' GSP R3 were used to obtain 
the 5' regions. The former was designed based on the 5' conservative NEO 
sequences between A. capillus-veneris and C. intermedia var. glabra, while the 
latter according to the internal regions obtained above. The 3' NEO region of P. 
distinctissima was obtained using the same SON PCR procedures and primers 
as those of C. intermedia var. glabra, whereas the 3' sequence of P. 
lakhimpurnense NEO was gained by IPCR with Apol restriction enzyme 
system and the primer pairs 3' GSP F2/ R2 and 3'GSP F3/ R3. Amplified PCR 
products were purified, cloned, sequenced and assembled as described above. 

Sequence accession numbers.—NEO gene sequences were submitted to 
NCBI database with the Accession Numbers FJ501964—FJ501966. 

Sequence analysis and prediction of the NEO structure. —The open reading 
frame (ORF) finder and BLAST tool were used to predict coding sequence and 
estimate similarities of NEO with other genes. The translated amino acid 
sequences of each gene were analyzed using the program Protparam on Expasy 
website. Protein domain search, prediction of protein hydrophobicity, 
transmembrane structures emd signal peptide cleavage sites were performed 
mdth Prosite (Hofmann et al., 1999), ProtScale (Gasteiger et al., 2005), TMpred 
(Hofmann and Stoffel, 1993) and SignalP (Emanuelsson et al., 2007) programs, 
respectively. SOPMA (Geourjon and Deleage, 1995) analyses were carried out 
to predict the secondary structure. Three-dimensional (3D) structures of 
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functional domains were computer-modeled by SWISS-MODEL (http://www. 
isb-sib.ch/) program. 3D structures of PAS (Per / Arnt / Sim) and GAP (cGMP 
specific phosphodiesterase / adenylate cyclases / formate hydrogen lyase 

o'" 

transcription) were predicted based on the 1.45 A crystal structure of 
Deinococcus radiodurans Phytochrome (PDB code: 2o9c). The LOV (Light, 
Oxygen and Voltage-sensing) 1, LOV2 and STKc (Serine/Threonine protein 
kinases, catalytic) domains were computer-modeled using fungal photorecep¬ 
tor VVD (PDB code: 3d72A; Resolution: 1.65 A), Adiantum NEO LOV2 (PDB 

D 

code: IjnuB; 2.60 A) and Human Myotonic Dystrophy Protein Kinase (PDB 

o 

code: 2vd5A; 2.80 A) as templates, respectively. Models were displayed with 
the PyMol program (DeLano, 2002). 

Sequence alignment and phylogenetic analysis. —Other than the sequences 
cloned in this work, sequences used for comparison and phylogenetic analysis 
were downloaded from Genbank. Complete species names and accession 
numbers are listed in Table 2. Based on the predicted structure of NEO, we 
divided the NEO sequences data into N-terminal phytochrome-like and C- 
terminal phototropin-like regions. Sequences were aligned using Clustal X 2.0 
and then adjusted manually. The phylogenetic trees were constructed from the 
aligned coding DNA using the maximum likelihood (ML) method as 
implemented in PHYML (Guindon and Gascuel, 2003). Before ML analyses, 
we first identified the best fit model of molecular evolution for the N- and C- 
terminal datasets using the Akaike Information Criterion (AIC) as implemented 
in ModelTest (Posada, 2003). The model of the general time reversible (GTR) + 
4G (four categories of Gamma substitution rates) -i- I (invariable sites) model 
were applied to each datasets. A nonparametric bootstrap test was conducted 
with 1000 replicates. 


Results and Disclission 

Cloning and sequence analysis of the complete coding sequences. —Using 
diversified genome-walking PGR approaches and the primers in Table 1, the 
NEO fragments were amplified from three fern species (Fig. 1). After cloning 
and sequencing, the assembling results displayed that the NEO genes from 
three fern species, C. intermedia var. glabra, P. distinctissima, and P. 
lakhimpurnense, are 4937 bp, 4424 bp, and 4456 bp in length, respectively. 
Comparative analyses with the full-length A. capillus-veneris L. NEO [AcNEO] 
cDNA sequence showed that introns are all absent in these genes. Ahead of the 
original codon ATG and after the stop codon TGA, there are 294-bp, 6-bp and 
6-bp 5'- untranslated regions (UTR). in addition to a 311-bp, 100-bp and 139- 
bp 3'-UTR for C. intermedia var. glabra, P. distinctissima. and P. lakhimpur¬ 
nense, respectively. Each sequence consists of an ORF of 4332 (C. intermedia 
var. glabra), 4308 (P. distinctissima) and 4317 bp (P. lakhimpurnense), which 
is very similar to that of AcNEO (4398 bp). Their guanine-cytosine (GC) 
contents are 59.5%, 61.0% and 58.7%. respectively. The three genes each 
encode a deduced protein of 1443, 1435 and 1438 amino acid residues with a 
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2000bp 
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500bp 

250bp 

lOObp 


MI2M3M4M5M6 




Ml M2M3M1 M5 



M1M2M3 M1M5 M6 



Fig. 1. Agarose gel electrophoresis of amplification products for Coniogramme intermedia, var. 
glabra (a), Plagiogyria distinctissima (b), and Pronephrium lakhimpurnense NEO fragments (c). M 
is molecular weight marker (DL2000). al and a2: the products amplified by the primer pairs GSP 
Fl/Rland GSP F3/R3: a3: the products amplified by the primer pairs GDP F2/R2; a4 and a5: the 
products amplified by the first and second IPCR using the primer pairs 5' GDP Fl/5' GSP Rl and 5' 
GSP F2/5' GSP R2, respectively: a6; the amplification products of SON PGR by the primer 3' GSP 2. 
hi: the products amplified by the primer pairs 5' GSP F3/5' GSP R3; b2: the products amplified by 
the primer pairs GSP Fl/Rl: b3: the products amplified by the primer pairs GDP F2/R2; b4: the 
products amplified by the primer pairs GSP F3/R3; b5: the amplification products of SON PGR bv 
the primer 3' GSP 2. cl: the products amplified by the primer pairs 5' GSP F3/5' GSP R3: c2: the 
products amplified by the primer pairs GSP Fl/Rl; c3: the products amplified by the primer pairs 
GDP F2/R2: c4: the products amplified by the primer pairs GSP F3/R3; c5 and c6: the amplification 
products of the first and second IPCR by the primer pairs 3' GSP F2/ R2 and 3'GSP F3/ 
R3. respectively. 


theoretical isoelectric point (pi) of 6.44, 6.27 and 6.28 and a calculated 
molecular mass about 160.15, 158.51 and 158.28kDa. 

Amino acid compositions of the three deduced proteins are very similar to 
each other. The richest amino acid is Leu (9.8~10.5% by frequency), followed 
by Ala (8.0~8.7%), Gly (7.8~8.7%), Ser (7.3~7.8%) and Arg (7.1-7.5%). 
Acidic and basic amino acids constitute about 12% and 11% of the protein 
separately. Of the total amino acids, 23% are polar while 32% charged. The 
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Ph\'Tochrome 


Phototropin 


MI 



COOH 


Fic. 2. The common structure of NEO in fern species. It shows that the N-terminal phytochrome 

chromophoro domain contains PAS, GAF, and PHY domains and C-terminal phototropin motif 
posses.ses two LOV and one STKc domains. 


deduced NEO proteins were predicted to have a net charge of -5.1, -9.2, and 
— 7.9 at pH 7.0. They each contain 488, 495 and 494 hydrophobic amino acids 
(33.8%, 34.5% and 34.4% by frequency), and are all predicted to have a 
number of typical hydrophobic regions identified by Kyte and Doolittle 
hydropathy analyses (Gasteiger et ai, 2005). Each protein possesses several 
helices thought to be transmembrane domains with the N- terminal domains 
on the intracellular side. However, analysis of the primary structure did not 
reveal any signal peptide, indicating that they are most probably cytoplasmic. 
The results are in accordance with /IcNEO, which is a plasma-membrane 
localized protein (Nozue et al.. 1998). However, its putative association with 
the membrane remains unclear. 

The three genes show 78~80% identitv with AcNEO at the nucleotide acid 
level as well as >88% identity among themselves (Table 3). The N-terminal 
sections of NEO exhibits 40—58% identity with seeded vascular and non- 
vascular plant PHYs, and the C-terminal regions show 40—52% identity with 
plant PHOTs. Protein domain searches displayed that all three proteins 
possess the typical features of fern NEO family. They share three domains of 
phytochrome photosensory core region in the N-terminus and a nearly full 
phototropin homolog in the C-terminus (Fig. 2). Henceforth, we denoted the 
three new fern PHY-PHOT chimera genes as C. intermedia var. glabra NEO 
[CiNEO], P. distinctissima NEO [PdNEO], and P. lakhimpurnense NEO 
{PINEO), respectively. 

By contrast, the sequences of the chimeric photoreceptors are not conserved 
between the fern and alg ae. Fern NEOs are all distantly related to Mougeotia 
scalaris NEO [MsNEO]l and MsNEOl with <27% identity (Table 3). MsNEOl 
and MsNE02 exhibit lower identity to seeded vascular and non-vascular plant 
PHYs (<46%) and PHOTs (<45%) compared with fern NEO. In combination 
with the results that fern NEO genes are intronless but MsNEOl and MsNE02 
have many introns (Nozue et al.. 1998; Suetsugu et al., 2005), we also speculate 
that fern and alga NEO arose independently and evolved convergentlv. This is 
in agreement with a previous study (Suetsugu et al., 2005). However, it cannot 
be excluded that the fern NEOs might have experienced intron losses during 
their evolution. 

Phylogenetic analysis. —Fern and alga NEOs each constitute a clade with 
strong statistical support in the ML trees (Fig. 3). They further form a major 
clade in the tree established from the sequences of phytochrome sensor 
domains, while seeded vascular and non-vascular plants (including the 
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Table 3. Percentage of sequence identity among fern and alga NEO, canonical phytochromes, 
and phototropins/^ 



AcNEO 

CiNEO 

PdNEO 

PINEO 

Ms NEO 1 

MsNE02 

AcNEO 

CiNEO 

78.6 










PdNEO 

79..5 


89.4 








PINEO 

78.3 


88.0 

89.1 







MsNEOl 

26.3 


26.5 

26.6 


26.3 





Ms NEO 2 

26.8 


26.8 

27.1 


26.8 

55,9 




Plant PHY*' 

41.2- 

-56.8 

41.1-57.2 

41.2- 

-57.6 

40.7-56.1 

40.6- 

-45.5 

39.9- 

-45.5 

Plant PHOT*' 

40.5- 

-51.7 

40.3-51.3 

40.3- 

-51.9 

40.6-51.3 

33.2- 

-45.1 

33.3- 

-45.0 


^ The sequences of CiNEO, PdNEO and PINEO were obtained by cloning and sequencing in this 
work, and the otliers were downloaded from Genbank database. Complete species names and 
accession numbers were listed in Table 2. 

*’ The alignment with plant PHY and PHOT used the phytochrome-like N-terminal and 
phototropin-like C-terminal region, separately. Plant PHY and plant PHOT were in accordance 
with PHY and PHOT in molecular evolution analysis below. 


angiosperm, gymnosperm, fern, moss, liverwort and alga) comprise another 
(Fig 3a). By contrast, for the ML tree constructed using the sequences of 
phototropin domains (Fig 3b), the fern NEOs were grouped together with 
seeded vascular and non-vascular plant PHOTs rather than alga NEOs. 
However, in each tree none of these clades received strong support. NEO has 
been regarded as the fusion between phytochrome and phototropin motifs 
(Suetsugu et al., 2005). Since a robust phytogeny cannot be reconstructed here, 
we must await more necessary data to resolve the origin of fern NEO genes. 

Predictions of secondary and tertiary' structure. —The predicted secondary 
structures exhibit that the putative NEO polypeptides all contain about 37% 
alpha helices, 17% extended strands, 9% P turns and 37% random coils. 
Alpha helices and random coils constitute the interlaced domain of the main 
part of the secondary structure. 

The predicted 3D structure of functional domains in CiNEO, PdNEO, and 
P/NEO are displayed in Fig. 4. The 3D structure models of PAS and GAF 
domains were constructed with E-values of 2.9E~^®. 1.8E and 1.2E~^’^ for 
C/NEO. PdNEO, and P/NEO, respectively. The PAS and GAF domains of the 
three proteins are highly conserved, which all share the characteristic structure 
of plant phytochromes (Wagner et al.. 2005). The main difference among CiNEO, 
PdNEO, and PiNEO is that Pc/NEO has a longer P-sheet at GAF domain, but 
CiNEO and P/NEO are random coils in that position. However, the principle of 
this difference remains unclear. Except for that, the three proteins each contain a 
figure-eight knot donated by GAF and PAS domains, and they all reserve the 
conserved cysteine residue (Cys306 for CiNEO. Cys300 for PdNEO and Cys301 
for P/NEO) in the GAF domain, which is associated with phytochromobilin 
(P<I)B) by a covalent linkage (Boylan and Quail, 1991). The adjacent histidine 
residue (His307 for CiNEO. His301 for PdNEO and His302 for P/NEO), required 
for tight chromophore-protein interaction, is also conserved. 









10 


AMERICAN FERN JOURNAL: VOLUME 100 NUMBER 1 (2010) 


a 




FpPhyI 

PpPhY3 
CpPhy2 
t pPh> I 
CpPhvJI 
PpPh>5fl 




^ PpPhySc 
^ PpPhySbJ 
PpPh)4 
PpPb>2 

J-'IpPby 

SmPhvI 

w 

AcPhj2 

AcPtiil 



P»PhyO 


l«Wl 


AlPtiyA 
- OsPhvA 



OsPtivC 

# 

UPhvF 

* 

— AtPtiYC 


PsPhyp 

igT NfPIijB 
Q MHhyB 
' ^ UPhyBl 



J2 




NcPhvl 


b 


MsAeo2 


MvNeoI 


Alga Neo 


Fern Neo 


Moss/ 

Liverwort/ 

Alga/ 

Fern 

Phy 



tt.1 




PbPhyB2 

PbPbvBt 

LjPhyB 

OsPhvB 
SbPliyB 

ZmPbvBl 

^ ZinPbvB! 

AtPhvD 
<• 

\tPhvB 
- AiPhyE 
foPhvE 



Angiosperm/ 

Gymnosperm 

Phy 


CrPhoi 





MsNfol 


Alga Neo 




AcNco 






PdNeo 

PINeo 




PpPhot.43 


JtiDv 







PpPbMAI 


liMt 



PpPhoiA 2 

PpPhotB3 
PpPholB2 
PpPhotAI 




AcPhol2 



M«PhoLA 
\UPhotB 


AcPbotl 
O^Npblb 
ItPlMtl 
LePhotl 
h Phol2 

NTphotli 

P^pbotli 

hphotlb 

PsPhotl 
V^faoilb 
LePhotl 
\fPho(l 
ZmPhotl 
AsVpbtl 



I 

OiNphIa 

OiApbl 


Fern Neo 


Fern/ 

Moss/ 

Alga 

Phot 


Angiosperm/ 

Gymnosperm 

Phot 


Fit.. Phylogenetic trees established from nucleotide acid sequences of ph\tochrome sensor 

domains (a) and phototropin domains (b) generated by the maximum likelihood method using 

PHYML software, which are arbitrarily rooted with Neurospora crassa phytochrome (NcPhyl) and 

Cbhinn (lomonus reinhardtii blue light receptor (CrPhot). Numbers above branches or near to the 

riglit of nodes indicate bootstrap values (only values>500 are indicated) obtained from 1000 

rei)ii{;ations. The scale bar represents substitution number per nucleotide site. Neo branches are 
marked with bold dark lines. 


I he LOVl and LOV2 structures of three NEO are nearly identical (Fig. 4b 
and 4c). which is consistent with the high degree of sequence conservation at 
the amino acid level (Fig. 5). The LOV domains all possess the typical a/p fold 
of the PAS family, comprising five stranded anti-parallel P-sheets (Pl—ps) and 
three a-helical segments (al-a3), which are with a backbone P-scaffold and a 
helical connector forming a tight cage enclosing the FMN (Pandini and Bonati, 
2005). All LOV domains contain the highly conserved motif NCRFLQ and 
constitute a single turn of a 3^0 helix (a'A), contributing to the chemistry of 
photoactivation and subsequent downstream signaling (Crosson and Moffat, 
2001). The LOV domains also contain the conserved flavin-binding residues 
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b 


Helical connector 



P-scaffold 


L0V1 


PAS and GAP 



LOV2 


d 



# 


STKc 


Fic. 4. The alignment of the predicted three-dimensional structure of NEO functional domains for 
three fern species Coniogramme intermedia var. glohra (dark gray), Plcigiogyria distinctissima 
(gray) and Pronephriuni lakhimpurnense (light gray), (a) The predicted 3D structure of PAS and 
GAP domains. The ph>dochromobilin (P4>B) linked with cys residues are displayed with sticks. 
The figure-eight knots are marked with black arrow. (b) and (c) The predicted 3D structure of LOVl 
and LOV2 domains, respectively. The FMN ligand linked with cys residues are displayed with 
sticks. (d) The predicted 3D structure of STKc domains. Areas of the P-loop (phosphate-binding 
loop). C-loop (catalytic loop) and A-loop (activation loop) are marked with three black arrows. 


including the photochemically active cysteine residues (Cys712, 982 for 
CiNEO; Cys702, 973 for PdNEO; and Cys707, 977 for P/NEO). Furthermore, 11 
residues that interact with FMN were identified based on the results of the 
flavin-binding domain (Crosson and Moffat, 2001). Residues in a'A and a3 
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C/NEO LOVl 675 
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/»/NEO LOVl 670 
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P/NEO LON 2 992 
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RKAVQL 


LTIAPIRDDSIVKLV 


Fig. 5. Alignment of the eight LOV domains and the identification of flavin-interacting residues of 

Coniogramme intermedia var. glabra, Plagiogyria distinctissima and Pronephrium lakhimpur- 

nense. NEO proteins. Secondary structure of LOV is marked above alignment: p-strand. arrows; 

helix, windings. Dark gray background residues are conserved in all LOV domains. Gray residues 

are conserved in all LOVl domains, whereas light gray residues are conserved in all LOV2 

domains. NCRFLQ residues are specific motif in LOV domains. Residues that interact with FMN 

are marked above with black vertical arrows, whereas variable residues interacting with FMN 
marked bottom with black dots. 


helixes as well as |3-strands 3 and 4 make the majority of flavin contacts. Site- 
directed mutagenesis analysis suggested that the FMN chromophore interact¬ 
ing site Glnl029 in Adkintum (corresponding to Glnl045 in the LOV2 of 
QNEO, Glnl036 in Pc/NEO and Glnl039 in P/NEO) plays an important role in 
the protein structural changes of NEO-LOV2. Most of the FMN interacting 
residues are conserved in the aligned LOV domains with only two exceptions. 
One is the residue 725 (C/NEO LOVl as reference), which is an isoleucine in 
CiNEO and P/NEO LOVl but a valine in other LOVl and all LOV2: the other is 

is a leucine in LOVl but a phenylalanine in LOV2 
(Fig. 5). For Adiantum NEO, it has been suggested that the two amino acid 
differences between LOVl and LOV2 is relative to the distinct roles that they 
play in the photoregulation process (Yamamoto et al.. 2008). 

The NEO kinases of three fern species all have a 7 (/p structure similar to PKA 
(c-AMP-dependent protein kinase A) containing the autophosphorylation site 
(Akamine et al., 2003) as shown in Fig. 4d. Similarly, the NEO STKc domains 
all fold into a two-lobed structure including one smaller N-terminal and one 
larger C-terminal lobes. Based on the information of functional sites in PKA 
(Shaltiel et al., 1998). we examined the kev sites in fern NEO. The N-terminal 
lobe in the STKc domains contains the glycine rich and conserved motif 
GSGDTG as well, which contribute to the P-loop (phosphate-binding loop) of 
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PKA, acting as an ATP-binding site (Akamine et al., 2003). The C-terminal lobe 
contains the typical amino acid sequences of the C-(catalytic) and the A- 
(activation) loop of PKA. In the C-loop regions, all the residues are conserved 
including the phosphorylation reaction catalyzing residue {Aspl253. 
Aspl244, and Aspl247 for C/NEO, PdNEO, and P/NEO, respectively). For 
the A-loop, C/NEO is formed mainly by two P-sheets, Pc/NEO by two oc-helixes 
while P/NEO by one o(-helixe and two small p-sheets. Whether this difference 
contributes to the activation ability remains unknown. By contrast, the Asp 
residue responsible for chelating Mg^^ ions and the consensus phosphoryla¬ 
tion site (Serl313, Serl305 and Serl308 for C/NEO, Pc/NEO, and P/NEO, 
respectively), which have been reported to be required for full kinase activity 
(Tokutomi et al., 2008), are conserved in all the NEO STKc domains. In 
summary, the NEO STKc domains for three fern species well preserve the PKA 
functional motifs, suggesting that NEO may have a molecular mechanism 
similar to PKA. 


Conclusion 

In this study, we have described the isolation and sequence analyses of full- 
length NEO genes from one tree fern and two polypod ferns. This may suggest 
that NEO genes are widely exist in many fern species (at least in tree ferns an d 
polypod ferns). Molecular characterization of the three NEO genes reveals that 
their ORE size, GC content, basic protein properties and the predicted 
secondcuy and 3D structures are very similar to those of Adiantum NEO. In 
fact, the major structural motifs are all identical in fern NEOs, implying that 
the NEO may share a common mechanism for light-sensing in different fern 
species. The minor differences among them may relate to the individual 
diversification. Its N-terminus has three domains with homology to phyto¬ 
chrome and C-terminus contains three domains similar to phototropin. The 
comparisons of the PAS and GAF structures with plant phytochromes and 
STKc domain with PKA also reveal that the NEO well preserves the functional 
motifs. The analysis of the functional domains and key sites relating to both 
regulation and catalyzation may provide useful information on the molecular 
mechanism underlying the photoregulation mediated by NEO proteins. 
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